Rhetorical Structure Analysis of Japanese Patent Claims using Cue Phrases
نویسندگان
چکیده
The most important part of patent specification is where the claims are written. It is common that claims written in Japanese are described in one sentence with peculiar style and are difficult to understand for ordinary people. We are investigating NLP technologies to improve readability of patent claims. To do so, it is necessary to present the structure of patent claims in a readable way. We found that there are several typical phrases used in claim descriptions and that they can be used as clues to analyze the rhetorical structure of patent claims. We propose a method to analyze the rhetorical structure of patent claims by using these cue phrases and report the result of evaluation.
منابع مشابه
The Rhetorical Parsing of Natural Language Texts
We derive the rhetorical structures of texts by means of two new, surface-form-based algorithms: one that identifies discourse usages of cue phrases and breaks sentences into clauses, and one that produces valid rhetorical structure trees for unrestricted natural language texts. The algorithms use information that was derived from a corpus analysis of cue phrases.
متن کاملUsing Cohesive Devices to Recognize Rhetorical Relations in Text
This paper investigates factors that can be used in discourse analysis, specifically, cohesive devices. The paper shows that cohesive devices such as cue phrases can provide information about the linkages inside a text. We propose three types of cue phrases (the ordinary cue phrases, noun-phrase cues, and verb-phrase cues). An algorithm to compute rhetorical relations between two elementary dis...
متن کاملBeyond String Matching and Cue Phrases: Improving Efficiency and Coverage in Discourse Analysis
RASTA (Rhetorical Structure Theory Analyzer), a discourse analysis component within the Microsoft English Grammar, efficiently computes representations of the structure of written discourse using information available in syntactic and logical form analyses. RASTA heuristically scores the rhetorical relations that it hypothesizes, using those scores to guide it in producing more plausible discou...
متن کاملThe Rhetorical Parsing of Unrestricted Texts: A Surface-Based Approach
Coherent texts are not just simple sequences of clauses and sentences, but rather complex artifacts that have highly elaborate rhetorical structure. This paper explores the extent to which well-formed rhetorical structures can be automatically derived by means of surface-form-based algorithms. These algorithms identify discourse usages of cue phrases and break sentences into clauses, hypothesiz...
متن کاملDiscursive Usage of Six Chinese Punctuation Marks
Both rhetorical structure and punctuation have been helpful in discourse processing. Based on a corpus annotation project, this paper reports the discursive usage of 6 Chinese punctuation marks in news commentary texts: Colon, Dash, Ellipsis, Exclamation Mark, Question Mark, and Semicolon. The rhetorical patterns of these marks are compared against patterns around cue phrases in general. Result...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002